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APPLICANT : ICARD - LIEPKALNS , 
MALLET, Jacques 
RAVASSARD, Philippe 

TITLE OF INVENTION: POLYPEPTIDES OF THE "BASIC- HELIX - LOOP - HELIX " bHLH 
FAMILY; 

CORRESPONDING NUCLEIC ACID SEQUENCES 
FILE REFERENCE: ST96042AUS 

CURRENT APPLICATION NUMBER: US/09/59 5 , 94 7C 
CURRENT FILING DATE: 2000-06-16 
PRIOR APPLICATION NUMBER: FR96/15651 
PRIOR FILING DATE: 1996-12-19 
PRIOR APPLICATION NUMBER: PCT/FR97/02 36 8 
PRIOR FILING DATE: 1997-12-19 
PRIOR APPLICATION NUMBER: US09/331,356 
PRIOR FILING DATE: 1997-12-19 
NUMBER OF SEQ ID NOS : 2 8 
SOFTWARE: PatentIn version 3 
SEQ ID NO: 1 
LENGTH: 1460 
TYPE : DNA 

ORGANISM : Rattus norvegicus 
SEQUENCE : 1 
gcaggtagcg agaggagcag 
gcagcccggc aggcacgctc 
cgattagcag ctcagaagtc 

ccgagcttct ttgctgcctc cagacgcaat ttactccagg cgagggcgcc 

aggggttcag ctatccaccg ctgcttgact 
gcccggagta actaggtaac atttaggaac 
gcgtactcta gtcccgcgtg gagtgacctc 
ttttttccca acctcaggat ggcgcctcat 
caagagaccc agcaaccctt tcccggagcc 
accccaccta gccccactct cgtaccgagg 
cgagggacat cgaggaagct ccgtgcgcgg 



,0 



tccctgggcc cccgttgctg attggcccgt ggcacaggca 
ctggtccggg cagagcagat aaagcgtgcc aggggacaca 
cctctgggtc tcaccactgc acagaggccg aggaccccct 



caaaacttcg aagcgagcag 
gcagctctct gttcttttga 
tagaagaggg gagtgggtgg 
actgtcacac cccccttcca 
cgcccaccat ccaagtgtcc 
aagtgctcag ttccaattcc 
aagcagaagc aggtgactgc 



gcaacaggcc caagagcgag ttggcactga gcaagcagcg acgaagccgg 
ccaacgaccg ggagcgcaac cgcatgcaca accttaactc cgcgctggat 
gtgtcctgcc caccttcccg gatgacgcca aacttacaaa gatcgagacc ctgcgcttcg 



tgcagctcag 
ctgaccaccc 
ctccaaaggg 
taagtcagag 
cccttggatg 
tcggaccacg 
gactgctccg 
cgcggagggc 
cgcaagaagg 
gcgctgcgcg 



cccacaacta catttgggca 
gccccgagcc ccctgtgccc 
actggggctc tatctactcc 
tggaggagtt ccctggcctg 
tggtgttctc agacttcttg 
aaagggaggg agtcagagct 
cccttctggc tttcattagt 



ctgactcaga cgctgcgcat agcggaccac agcttctacg 

tgtggggagc tgggaagccc gggagggggc tccagcggcg 

ccagtttccc aagctggtag cctgagcccc acagcctcat 

caggtgccca gctccccatc ctgtctgctc ccgggcaccc 

tgaagggccc aaacaggccc tgggcggtgg gcgctggcag 

gtctgaaatg gaaggtagtg gaggcactcg agcatctcgc 

caggtccctg atttaaccag gattcgcaca gttccttgct 
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gctgtgcgtg cacaaaggac attgcaggct gatctcctct 
acctcaaact cccgctccaa gcagaggaga gccgtagcac 
atacttcctg gtgactccgc cctctttcaa atctgcgggc 
agagtgacct aatccagtgt 
SEQ ID NO: 2 
LENGTH: 24 
TYPE: PRT 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: peptide fragment of bHLH 
SEQUENCE: 2 

Ala Ala Thr Lys His Gly Met Gly lie Gly Ala 
15 10 
Asp Lys Cys Gly Cys Arg Tyr Gly 
20 

SEQ ID NO: 3 
LENGTH: 24 
TYPE : PRT 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: peptide fragment of bHLH 
SEQUENCE : 3 

Gly Gly Cys Ser Arg Asp Thr Tyr Thr Cys Ala 
15 10 
Tyr Asx Gly Ala Tyr Cys Thr Thr 
20 

SEQ ID NO: 4 
LENGTH: 2 5 
TYPE : DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: primer 
SEQUENCE : 4 

aaccttaact ccgcgctgga tgcgc 
SEQ ID NO: 5 
LENGTH: 18 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: primer 

SEQUENCE : 5 

cgcggtgtcc tgcccacc 

SEQ ID NO: 6 

LENGTH : 6 

TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: DNA sequence of E box 
SEQUENCE: 6 



taaccctcct cagtgtggcc 
taaatagttg ggagactccc 
ctccaaccac cgctttctcc 



1320 
1380 
1440 
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99 caggtg 6 

101 <210> SEQ ID NO: 7 

102 <211> LENGTH: 6 

103 <212> TYPE: DNA 

104 <213> ORGANISM: Artificial Sequence 

105 <220> FEATURE: 

106 <22 3> OTHER INFORMATION: DNA sequence of mutated E box 

107 <400> SEQUENCE: 7 

108 tccgtg 6 

110 <210> SEQ ID NO: 8 

111 <211> LENGTH: 214 

112 <212> TYPE: PRT 

113 <213> ORGANISM: Rattus norvegicus 

114 <400> SEQUENCE: 8 



115 Met Ala Pro His Pro Leu Asp Ala Pro Thr lie Gin Val Ser Gin Glu 

116 15 10 15 

117 Thr Gin Gin Pro Phe Pro Gly Ala Ser Asp His Glu Val Leu Ser Ser 

118 20 25 30 

119 Asn Ser Thr Pro Pro Ser Pro Thr Leu Val Pro Arg Asp Cys Ser Glu 

120 35 40 45 

121 Ala Glu Ala Gly Asp Cys Arg Gly Thr Ser Arg Lys Leu Arg Ala Arg 

122 50 55 60 

123 Arg Gly Gly Arg Asn Arg Pro Lys Ser Glu Leu Ala Leu Ser Lys Gin 

124 65 70 75 80 

125 Arg Arg Ser Arg Arg Lys Lys Ala Asn Asp Arg Glu Arg Asn Arg Met 

126 85 90 95 

127 His Asn Leu Asn Ser Ala Leu Asp Ala Leu Arg Gly Val Leu Pro Thr 

128 100 105 110 

129 Phe Pro Asp Asp Ala Lys Leu Thr Lys lie Glu Thr Leu Arg Phe Ala 

130 115 120 125 

131 His Asn Tyr lie Trp Ala Leu Thr Gin Thr Leu Arg lie Ala Asp His 

132 130 135 140 

133 Ser Phe Tyr Gly Pro Glu Pro Pro Val Pro Cys Gly Glu Leu Gly Ser 

134 145 150 155 160 

135 Pro Gly Gly Gly Ser Ser Gly Asp Trp Gly Ser lie Tyr Ser Pro Val 

136 165 170 175 

137 Ser Gin Ala Gly Ser Leu Ser Pro Thr Ala Ser Leu Glu Glu Phe Pro 

138 180 185 190 

139 Gly Leu Gin Val Pro Ser Ser Pro Ser Cys Leu Leu Pro Gly Thr Leu 

140 195 200 205 

141 Val Phe Ser Asp Phe Leu 

142 210 



144 <210> SEQ ID NO: 9 

145 <211> LENGTH: 1330 

146 <212> TYPE: DNA 

147 <213> ORGANISM: Homo sapiens 

148 <400> SEQUENCE: 9 

149 cctcggaccc cattctctct tcttttctcc tttggggctg gggcaactcc caggcggggg 60 

150 cgcctgcagc tcagctgaac ttggcgacca gaagcccgct gagctcccca cggccctcgc 120 
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151 tgctcatcgc tctctattct tttgcgccgg tagaaaggta atatttggag gccttcgagg 180 

152 gacgggcagg ggaaagaggg atcctctgac ccagcggggg ctgggaggat ggctgttttt 240 

153 gttttttccc acctagcctc ggaatcgcgg actgcgccgt gacggactca aacttaccct 300 

154 tccctctgac cccgccgtag gatgacgcct caaccctcgg gtgcgcccac tgtccaagtg 360 

155 acccgtgaga cggagcggtc cttccccaga gcctcggaag acgaagtgac ctgccccacg 420 

156 tccgccccgc ccagccccac tcgcacaccg gggaactgcg cagaggcgga agagggaggc 480 

157 tgccgagggg ccccgaggaa gctccgggca cggcgcgggg gacgcagccg gcctaagagc 540 

158 gagttggcac tgagcaagca gcgacggagt cggcgaaaga aggccaacga ccgcgagcgc 600 

159 aatcgaatgc acgacctcaa ctcggcactg gacgccctgc gcggtgtcct gcccaccttc 660 

160 ccagacgacg cgaagctcac caagatcgag acgctgcgct tcgcccacaa ctacatctgg 720 

161 gcgctgactc aaacgctgcg catagcggac cacagcttgt acgcgctgga gccgccggcg 780 

162 ccgcactgcg gggagctggg cagcccaggc ggtccccccg gggactgggg gtccctctac 840 

163 tccccagtct cccaggctgg cagcctgagt cccgccgcgt cgctggagga gcgacccggg 900 

164 ctgctggggg ccacctcttc cgcctgcttg agcccaggca gtctggcttt ctcagatttt 960 
16 5 ctgtgaaagg acctgtctgt cgctgggctg tgggtgctaa gggtaaggga gagggaggga 1020 

166 gccgggagcc gtagagggtg gccgacggcg gcggccctca aaagcacttg ttccttctgc 1080 

167 ttctccctag ctgacccctg gccggcccag gcctccacgg gggcggtagg ctgggttcat 1140 

168 tccccggccc tccgagccgc gccaacgcac gcaacccttg ctgctgcccg cgcgaagtgg 1200 

169 gcattgcaaa gtgcgctcat tttaggcctc ctctctgcca ccaccccata atcccattca 1260 

170 aagaatacta gaatggtagc actacccggc cggagccgcc caccgtcttg ggtcgcccta 1320 

171 ccctcactca 1330 

173 <210> SEQ ID NO: 10 

174 <211> LENGTH: 214 

175 <212> TYPE: PRT 

176 <213> ORGANISM: Homo sapiens 

177 <400> SEQUENCE: 10 

178 Met Thr Pro Gin Pro Ser Gly Ala Pro Thr Val Gin Vai Thr Arg Glu 

179 15 10 15 

180 Thr Glu Arg Ser Phe Pro Arg Ala Ser Glu Asp Glu Val Thr Cys Pro 

181 20 25 30 

182 Thr Ser Ala Pro Pro Ser Pro Thr Arg Thr Pro Gly Asn Cys Ala Glu 

183 35 40 45 

184 Ala Glu Glu Gly Gly Cys Arg Gly Ala Pro Arg Lys Leu Arg Ala Arg 

185 50 55 60 

186 Arg Gly Gly Arg Ser Arg Pro Lys Ser Glu Leu Ala Leu Ser Lys Gin 

187 65 70 75 80 

188 Arg Arg Ser Arg Arg Lys Lys Ala Asn Asp Arg Glu Arg Asn Arg Met 

189 85 90 95 

190 His Asp Leu Asn Ser Ala Leu Asp Ala Leu Arg Gly Val Leu Pro Thr 

191 100 105 110 

192 Phe Pro Asp Asp Ala Lys Leu Thr Lys lie Glu Thr Leu Arg Phe Ala 

193 115 120 125 

194 His Asn Tyr lie Trp Ala Leu Thr Gin Thr Leu Arg lie Ala Asp His 

195 130 135 140 

196 Ser Leu Tyr Ala Leu Glu Pro Pro Ala Pro His Cys Gly Glu Leu Gly 

197 145 150 155 160 

198 Ser Pro Gly Gly Pro Pro Gly Asp Trp Gly Ser Leu Tyr Ser Pro Val 

199 165 170 175 

200 Ser Gin Ala Gly Ser Leu Ser Pro Ala Ala Ser Leu Glu Glu Arg Pro 
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180 185 190 

Gly Leu Leu Gly Ala Thr Ser Ser Ala Cys Leu Ser Pro Gly Ser Leu 

195 200 205 

Ala Phe Ser Asp Phe Leu 
210 

SEQ ID NO: 11 
LENGTH: 18 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: primer 
SEQUENCE : 11 

caacgaccgg cagcgcaa 18 
SEQ ID NO: 12 
LENGTH: 2 4 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: primer 
SEQUENCE: 12 

gcccagatgt agttgtgggc gaag 24 
SEQ ID NO: 13 
LENGTH: 6 0 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: primer 
FEATURE : 

NAME/KEY: misc_feature 

OTHER INFORMATION: n=a or t or g or c 
SEQUENCE : 13 

W--> 2 35 atcgttgaga ctcgtaccag cagagtcacg agagagacta cacggtactg gnnnnnnnnn 6 0 

SEQ ID NO: 14 
LENGTH: 2 0 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE: 

OTHER INFORMATION: primer 
SEQUENCE: 14 

agacgacgcg aagctcacca 20 
SEQ ID NO: 15 
LENGTH: 24 
TYPE: DNA 

ORGANISM: Artificial Sequence 
FEATURE : 

OTHER INFORMATION: primer 
SEQUENCE: 15 

gctcaccaag atcgagacgc tgcg 24 
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Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <22 0 
to <22 3> fields of each sequence which presents at least one n or Xaa. 

Seq#:13; N Pes . 52 , 53 , 54 , 55 ; 56 , 57 , 58 ; 59 , 60 
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